Modeling Pitch Contour of Chinese Mandarin Sentence with PENTA Model
نویسنده
چکیده
In continuous speech, it is believed that the pitch contour of the same syllable may vary a lot due to its different context information. To apply the Parallel Encoding and Target Approximation (PENTA) model to Mandarin speech synthesis and improve its prediction accuracy, this paper proposed a method to predict pitch contours for Chinese syllables with different contexts by combining the Classification And Regression Tree (CART) with the PENTA model. We first used CART to cluster syllables’ normalized pitch contours according to the context information of syllables and the distances between pitch contours. For each cluster, we calculated the average pitch contour and trained the PENTA model with this average contour. The initial pitch value is required while using PENTA model to predict a continuous pitch contour. We further proposed a Pitch Discontinuity Model (PDM) to predict such initial pitch values at positions where voiceless consonants and prosodic boundaries are found. We first conducted experiments on a Chinese four-syllable word corpus containing 2048 items and then extended experiments to a continuous speech corpus containing 5445 sentences. The results were satisfactory in terms of the Root Mean Square Error (RMSE) values comparing the predicted pitch contour with the original contour. With this method, we can model pitch contours for Mandarin sentences of any text and apply the trained model parameters into speech synthesis.
منابع مشابه
Modeling Pitch Contour of Chinese Mandarin Sentences with the PENTA Model
In continuous speech, the pitch contour of the same syllable may vary much due to its contextual information. The Parallel Encoding and Target Approximation (PENTA) model is applied here to Mandarin speech synthesis with a method to predict pitch contours for Chinese syllables with different contexts by combining the Classification And Regression Tree (CART) with the PENTA model to improve its ...
متن کاملPerception of intonation in Mandarin Chinese.
There is a tendency across languages to use a rising pitch contour to convey question intonation and a falling pitch contour to convey a statement. In a lexical tone language such as Mandarin Chinese, rising and falling pitch contours are also used to differentiate lexical meaning. How, then, does the multiplexing of the F(0) channel affect the perception of question and statement intonation in...
متن کاملHakka Pitch-contour Parameter Generation Using a Mandarin-trained Pitch-contour Model
In this paper, using an existing pitch-contour model of a Chinese dialect (Mandarin here) to generate pitch-contour parameters for another dialect’s sentences (Hakka here) is studied. This can be generally viewed as a pitch-contour model adaptation problem. We study this problem in hope to save tedious labors and research time needed to build a pitchcontour model for a specific Chinese dialect....
متن کاملA Sentence-pitch-contour Generation Method Using Vq/hmm for Mandarin Text-to-speech
In this paper, a method with sentence-wide optimization consideration is proposed to generate a Mandarin sentence's pitch-contour. The developed model is called the sentence pitch-contour HMM (SPC-HMM) due to its use of VQ (vector quantization) and HMM (hidden Markov model). To construct an SPC-HMM, the pitch-contours of the syllables from each training sentence are normalized on both time and ...
متن کاملIncorporating Pitch Features for Tone Modeling in Automatic Recognition of Mandarin Chinese
Tone plays a fundamental role in Mandarin Chinese, as it plays a lexical role in determining the meanings of words in spoken Mandarin. For example, these two sentences R R (I like horses) and R M (I like to scold) differ only in the tone carried by the last syllable. Thus, the inclusion of tone-related information through analysis of pitch data should improve the performance of automatic speech...
متن کامل